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(54) System and method for reducing the footprint of preloaded classes 



(57) A method and system that reduces the ROM 
space allocated for internal data structures by a runtime 
engine (such as the Java virtual machine). The internal 
data structures store member information for preloaded 
classes used by applications executed by the runtime 
engine. The system determines the different types of in- 
ternal data structures represented in the classes and 
identifies the possible values of each type's members. 
The system next determines the amount of space re- 
quired to store the values for each type in a respective 



value table and the number of bits needed to index each 
entry of that table. The system determines based on the 
stored information whether occurrences of a member 
are optimally represented as a set of value table indices 
and a value table or, in the conventional manner, as a 
general variable that stores the member's value for each 
occurrence. This determinatbn is based on the size of 
the general variable, the number of occurrences of the 
member, the memory needed for each index and the 
size of the value table. 
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Description 

[00011 The present invention relates generaliy to a class preloader and. particularly, to a system and method for 
reducing the size in read only memory of preloaded Java classes. 

BACKGROUND OF THE INVENTION 

[0002] A Java program comprises a number of small software components called classes. Each class contains code 
and data and is defined by information in a respective class file. Each class file is organized according to the same 
platform-independent "class file format*. Referring to FIG. 1, there is shown a block diagram of the class file format, 
according to which each class file 400 includes header information 402. a constant pool 404. a methods table 406 and 
a fields table 408. The header infomiation 402 identifies the class file format, the size of the constant pool, the number 
of methods in the methods table 406 and the number of fields in the fields table 408. The constant pool 404 is a table 
of structures representing various string constants, class names.' field names and other constants that are referred to 
within the class file structure and its sub-structures. The methods table 406 includes one or more method structures, 
each of which gives a complete description of and Java code for a method explicitly declared by the class. The fields 
table 408 includes one or more field structures, each of which gives a complete description of a field declared by the 
class. An example of the fields table 408 is now described in reference to FIG 1 B. 

[0003] A Java program is executed on a computer containing a program called a virtual machine (VM), which is 
responsible for executing Ihe code in Java classes. It is customary for the classes of a Java program to be loaded as 
late in the program's execution as possible: they are loaded on demand from a network server or from a kx;al file 
system when first referenced during the program's execution. The VM locates and toads each class, parses the class 
file format, allocates internal data structures for its various components, and links it in with other already loaded classes. 
This process makes the method code in the class readily executable by the VM. 

[0004] For small and embedded systems for which facilities required for class loading, such as a network connection, 
a local file system or other pernoanent storage, are unavailable, it is desirable to preload the classes into read only 
menrory (ROM). One preloading scheme is described in U.S. Patent Applcation Serial No. 08/655,474 ('A Method 
and System for Loading Classes in Read-Only Memory"), which is entirely incorporated herein by reference. In this 
method and system, the VM data structures representing classes, fields and methods in memory are generated offline 
by a class preloader. The pretoader output is then linked in a system that includes a VM and placed in read-only 
menrxjry. This eliminates the need for storing class files and doing dynamic class loading. 

[0005] Referring to FIG. 2A, there is shown a more detailed block diagram of the VM data structures 1 200 generated 
by the class preloader. The data structures 1200 include a class block 1202, a plurality of method blocks 1204, a 
plurality of field btocks 1214 and a constant pool 1224. 

[0006] The class block 1202 is a fixed-size data structure that can include the following infornnation: 

• the class name 1230; 

• a pointer 1 232 to the class block of the current class's immediate superclass; 

• a pointer 1 234 to the method blocks 1 204; 

• a pointer 1236 to the field blocks 1214; and 

• a pointer 1 238 to the class' constant pool; 

[0007] The elements of a class block data structure are referred to herein as class block members. 
[0008] A method block 1 204 is a fixed-sized data structure that represents one of the class's methods. The elements 
of a method block data structure are referred to herein as method block members. A field block 1214 is a fixed-size 
data structure that represents one of the class's instance variables. The elements of a field block data structure are 
referred to herein as field block members. 

[0009] Each type of VM data structure, including the class block 1202. method blocks 1204, field blocks 1214 and 
constant pool 1 224, has a format defined by a corresponding data structure declaration. For example, a single method 
block declaration defines the format of ali method blocks 1204. The data structure declarations also define accessor 
functions (or macros) that are used by the VM to access data structure members. These data structure declarations 
are internal to the VM and are not used by class components. The prior art data structure declarations are now described 
in reference to FIG. 2B. 

[0010] Referring to FIG. 28, there is shown a depiction of data structure declarations 1230 that define the format of 
all data structure types employed by a particular VM. Each declaration 1230 includes a set of member declarations 
1232 and accessor functions 1234 for accessing respective members. The member declarations 1232 and accessor 
functions 1234 are defined conventionally according to the syntax of the language used in the implementation of the 
VM. For example, assuming the C language is used In the data structure declarations 1 230. a generic field data structure 
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123aN (shown in FIG. 2B) could be defined as a structure T with five mennbers of the following types with respective 
accessor functions: 





member name member type accessor functions 


5 


memberl 


mtypel 


memi of (T) T->member1 




member2 


mtype2 


mem2 of (T) T->member2 




members 


mtypeS 


mem3 of (T) T->member3 




member4 


mtype4 


mem4 of (T) T->member4 


10 


members 


mtypeS 


memS of (T) T->member5 



[00111 In this example, the member types can be any type defined by the relevant computer language, including 
user defined types or language types, such as integer, float, char or double. The accessor functions are macros used 
by the VM to access the fields without needing toaccess directly the structure containing the field. For example, instead 
of employing the expression ■T->member1 " to access field! in structure type T. the VM need only employ the expression 
•memi of (T)". Accessor functions are well known in programming languages, such as C, that provide sophisticated 

data structure capabilities. ^ , ,^ i., l lono 

[0012] The intemal data structures used to store "class meta data* (i.e., the class, method and field blocks 1202, 
1204 1214) require large, fixed amounts of space in read-only memory. In fact, measurements indicate that this sort 
of class meta data often takes up much more space than the bytecodes for the class methods themselves. These 
intemal data structures are therefore often unsuitable for use in small, resource-constrained devices in which class 
preloading is desirable and/or necessary. 

[001 3] Moreover if the intemal data structures were individually modified to save memory space, the VM code would 
need to be extensively revised to enable the VM to correctly access the modified data structures. To make such changes 
to the VM could be onerous and inefficient. 

[00141 Therefore, there is need for a rrxjdified representation of the intemal data structures that is smaller in size 
than the prior art data structures, includes all inf ormatkxi required by the VM, and does not require extensive or onerous 
modification of the VM code. 
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30 SUMMARY OF THE INVENTION 

[0015] In summary, the present inventton provides a method and system that reduces the ROM space required for 
preloaded Java classes. 

[001 61 In particular, the method and system of the present inventkin are based upon the realization that, in an envi- 
ronment where the Java VM classes are pretoaded. it is highly likely that the VM would be a ctosed system with a set 
number of classes and class components, such as fieWs and methods. Such a closed VM woukJ include a fixed number 
of intemal data structures, such as class blocks, method blocks and field bkxks. Moreover, each member of these 
data structures (e g., a method bkjck or field block member) would have one of a well-known set of distinct values. 
[0017] Given this assumption and its implications, embodiments of the present invention reduce the memory space 
required to represent the internal data structures by: 

1 ) determining distinct values of each type of data structure member; 

2) determining occurrences of each data structure member type (e.g., each occurrence in the method blocks of a 
field block member type) and each occurrence's value; 

3) determ^iing memory space that would be saved if each occurrence were represented as an index to a table of 
values of the data structure member type rather than conventionally (storing the value for each occurrence in a 
general variable); and 

4) if sufficient savings would result, alkx:ating a value table containing the distinct data structure member type 
values and configuring each occurrence of that field block member type as an index to the appropriate value table 
entry; and 

5) generating new sources to the VM so that its access to the modified structures is adapted automatically. 

[0018] In a preferred embodiment, the decision is made to represent a data structure member type as a value table 
index plus a value table if the following comparison is true: 

(#occurrences of type) x (size of index) + (size of value table) < 
(#occurrences of type) x (size of general variable). 
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[001 9] Once the present method has determined for each data structure member type whether an occurrence of that 
type IS to be represented as an index into a value table or as a general variable storing the value, the present method 
ernits approprate intomiation for that type, including accessor functions, language declarations and source code that 
inrtializes the value tables. The accessor functions are macros through which all runtime access to the data structure 
members is accomplished by the VM. Preferably, prior to emitting the above-described infomiation, the present method 
de errnines the most compact arrangement of the value table indices, conventional representations of members and 
r™, generates the value tables, value table indices, accessor functions and classes accordingly. 

[0020] The present method emits accessor functions, decorations and other data structure information after deter- 
mining whether to modify the conventional representation of the data structure members. As a result, all emitted data 
structure infomiation is consistent with changes in the internal class representation. This automatic generation of con- 
• fT""^ information minimizes changes to the m that are required whenever new classes are added to 

' ^"'^ *1f '^'^s* representations change. This provides a significant improvement over the pror art 
^ . Embodiments of the system of the present invention include a collection of class files, a Java class preloader 
in Which he above method is implemented and output files generated by the preloader, including preloaded classes 
header files and source code files. 

[0022J The Class files define the complete set of classes to be preloaded. The preloader performs a first pass on the 
Class files to detenmine the: different types of members of the internal data structures, 

distinct values of each type of member, 
amount of space required to store the values, 
the size of the value indices, and 
the number of occurrences of each member type. 

[0023] The preloader then performs a second pass on the class files and the internal data structures to detemiine 
Xr^e outp'lTfiTes^ *° represented, conventionally or as an index to a value table entry, and then emits the ap- 

[0M4] The output files are compatible with similar files employed by conventional Java systems That is the ore- 
loaded classes can be assembled or compiled into class object data and the header files and source files can be 
compiled with VM sources into VM object data. The VM and class object data can then be linked in the conventional 
manner into the executable VM for a particular Java environment. conveniionai 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0025] Additional objects and features of embodiments of the invention will be more readily apparent from the fol- 
lowing detailed description and appended claims when taken in conjunction with the drawings, in which: 

FIG. 1 illustrates the class file format common to the prior art and embodiments of the present invention; 
lYiforr^ati^^ "^'^^'^ °' "^ed in the prior art to encode class, method and field 

^^trV^^^^^^ "^^^ declarations that define the fomiat of the VM internal data structures shown 

FIG. 3 is a block diagram of a distributed computer system in which the class preloader system and method of the 
present inventkjn can be embodied. i anu memoa oi me 

FIG. 4 is a block diagram of a executbn engine in the distributed computer system of FIG.1 in whch the preloaded 
classes generated by the class pretoader of FIG. 3 are toaded info ROM; 

FIG. 5 is a flow diagram illustrating the processing components used to produce the preloaded executable module; 

FIG. 6 is aflow diagram illustrating the processing components used to reduce the memory footprint of the preload- 
ed executable module; 



i. 7 A illustrates the organization of the updated header file 614 of FIG. 6; 
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FIG. 7B illustrates the organization of the value table 616 of FIG. 6; 

FIG. 8A illustrates the organization of same member occurrences and values after allocation In the execution 
engine ROM 208 in accordance with embodiments of the present invention; 

FIG. 8B illustrates the organization of same member occurrences after allocation in the execution engine ROM 
208 in accordance with the prior art; -"^ 

FIG. 9A illustrates the compact organization of a data structure instance with five members generated by embod- 
iments of the present inventbn; 

FIG. 9B illustrates the organization of the data structure instance from FIG. 9A generated by the prior art; 

fig! 10 is a flow chart of the method used by the class preloader to build the internal data structures used in the 
preloaded classes; and 

FIG 11 is a block diagram showing the mapping of a preloaded application into read-only memory and random- 
access memory and indicating the loading of the portion of the methods and data mapped into random-access 
memory by a static class initializer 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

[0026] The method and system described herein are directed to a Java class preloader configured to output preload- 
ed Java classes that are optimized for storage in the ROM of a target computer (referred to herein as the execution 
engine) Given that the execution engine is likely to be a computer with little or no secondary storage, it is preferable 
that the Java class preloader be implemented in a separate computer, which shall be referred to herein aa a sen/er 
Assuming such a configuration, the preloaded classes could be transferred from the server to the execution engine in 
a variety of ways (e.g., network connection, direct communication link or "sneaker net" transfer of readable media, 
such as floppy disks or CDs). Accordingly, a preferred embodiment of the present invention described herein is directed 
to a computer system with a sen/er and an execution engine wherein the preksaded classes are generated by the 
server and subsequently transferred to the execution engine for use in the VM. The preferred embodiment is now 
described in reference to FIGS. 3 and 4. 

[0027] Referring to FIG. 3. there is shown a distributed computer system 100 in which embodiments of the present 
invention may be implemented- The computer system 100 has one or more execution engines 102 and one or more 
server computers 104. In a preferred embodiment, each execution engine 102 is connected to the sen/er 104 via the 
Intemet 106, although other types of communication connections between the computers 102. 104 could be used (e. 
g network connection, direct communication link or "sneaker net" transfer of readable media, such as floppy disks or 
CDs) Preferably, the server and execution engines are desktop computers, such as Sun workstations, IBM compatible 
computers and/or Apple Macintosh computers; however, virtually any type of computer can be a server or execution 
engine Furthermore, the system is not limited to a distributed computer system. It may be implemented in venous 
computer systems and in various configurations, or makes or models of tightly^oupled processors or in various con- 
figurations of loosely-coupled microprocessor systems. 

[0028] The server computer 104 typically includes one or more processors 112, a communications interface 116. a 
user interface 114, and memory 110. The memory 110 stores: 

• an operating system 118; 

• an Internet communicattons manager program or other type of network access procedures 120; 

• a compiler 1 22 for translating source code written in the Java programming language into a stream of bylecodes; 

• a source code repository 1 24 including one or more source code files 1 26 containing Java source code; 

• a class file repository 128 including one or more platfomi-independent class files 130 and one or more class 
libraries 131 containing class files, each class file containing the data representing a particular class; 

• a class preloader 1 32 that generates a set of preloaded classes 148 for a particular configuration of the execution 
engine (the class preloader is sometimes referred to as a static class loader): 

• an assembler 1 34 that produces an object file representing the class members, class data structures and memory 
storage indicators in a format that is recognizable for the linker; 

• a linker 1 36 for determining the memory layout for a set of preloaded classes and for resolving all symbolic refer- 
ences; and 

• one or more data files 146 for use by the sen/er (including the preloaded classes 148) 
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(0029] Note that the class file repository 1 28, class preloader 1 32. assembler 1 34 and linker 1 36 need not reside on 
the server 104, but can be on any computer whose output (e g. , files or messages representing the preloaded classes 
148) can be copied to the execution engine 102. 

*° ^" ^'^ecution engine 102 can include one or more processors 202. a communications 

interface 206. a user interface 204. a read-only memory 208 and a random access memory 21 0. The read-only memory 
208 stores program methods that have no unresolved references and program data that remains constant durinq 
program operation. In the preferred embodiment, methods and data stored in the ROM 208 include portions of Java 
applications 21 2 and the execution engine's support procedures. These support procedures include an operatinq sys- 
tem 213. network access procedures 214, preloaded classes 232 and internal data structures 1200 (FIG 2) used bv 
the preloaded classes 232. " ' ' 

[0031] The random access memory 210 stores: 

• a second portion of the Java applications 215 and support procedures 216. 217 that contain methods havinq 
unresolved references and data that is altered during the application's execution- and 

• one or more data files 228 that the execution engine may utilize during its processing. 

[0032] Referring to FIG. 5, there is shown a flow chart illustrating the sequence of steps used to produce a preloaded 
executable module. It should be noted that the method and system described herein pertains to preloading a Java 
application and other support procedures. Any Java application, or any other set of methods that are normally linked 
at run time could be preloaded using the method and system described herein 

[0033] The source code 126 for each class that comprises the Java application is compiled by the compiler 122 into 
a class file 130. which is a platfomn-independent representation of the class. As described in reference to FIG 1 the 

class file ajntains field and method tables, each method'sbytecodes. constant data andother information^ 

the class files corresponding to the application can already reside in one or more class libraries 131 . The entire set of 

class files 1 28 that constitute an application to be preloaded are transmitted to the class preloader 1 32 

[0034] The job of the class prebader is to generate the preloaded classes 148 for an execution engine 102(FIG 4) 

The preloaded classes 148 include the class block 1202, method blocks 1204, field blocks 1214 and constant pool 

1224 described in reference to FIG. 2. Among other things, the class preloader 132 determines which methods and 

fields associated with each class 130 can be stored in a read-only memory 208 and whk:h must be stored in a random 

access memory device 210. For example, methods that invoke Java interfaces or utilize non-static instance variables 

need to reside in random access memory. This is because the bytecodes that implement interfaces are determined at 

runtime and non-static instance variables are altered for each instantiation of the associated class 

[0035] The class preloader 1 32 also perfomis a number of optimizations in order to produce a more compact internal 

representation of the executable code when that code is loaded into the executbn engine ROM 208. For example the 

^nr!r T'' ?! T"'"^' ^"^'^^^ ^""^ '^^'^ '° ^''^''^^'^ redundancy in the internal 

representation of the class constant pool 310. In accordance with the present embodiment, the class preloader 132 
also modifies the internal data structures 1200 (FIG. 2A) to take up less space in the ROM of the executbn engine 
102. 1 IS an advantage of the present embodiment that this data structure optmizatlon largely frees the internal rep- 
resentation from inefficient standard data structure fomiats 1 200 used in the prior art 

'^♦t preloaded classes 1 48 are transmitted to an assembler or compiler 1 34 that produces an object module 
3CWhavingtherequiredfom,atforthelinker136tomapthedataintotheapprop 

will be two address spaces, one for a random access memory devKe and a second for read<Dnly memoiy device The 
object rnodule 304 is then transmitted to the linker 1 36 whk:h generates a memory layout for the classes in the appli- 
cation. Once the memory layout is determined, the linker 136 resolves all symbolic references and replaces them with 
direct addresses. The memory layout is partitioned into the two address spaces. The methods and fields that were 
flagged for read-onV memory are included in the first address space and the methods and data that were flaqqed as 
requinng storage in a random access memory are included in a second address space. The output from the linker 136 
IS a preloaded executable module 306 containing the methods and data for these two address spaces. The processing 
flow of the present embodiment is now described with reference to FIG 6 » 
[0037] Referring to FIG. 6. there is shown a data flow diagram of the process employed by the present embodiment 
to reduce the memory footpnnt of internal data structures used by the VM. As already described in reference to FIG 
3. the class preloader 132 generates a set of platform-specific preloaded classes 148 from the class files 128 The 
preloaded classes 148 are data structure declarations that can bo declared in assembler source or by a high level 
language. An assembler 134 or compiler 122 then converts these data declarations to object data 622 The class 
S2dltessts°23r"''"'' representation of the internal data structures 1200 composing the 

[0038] In the preferred embodiment a member of the internal data structures can be represented in one of two ways: 
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1) as a generic memory word (e.g.. o1 32 bits) that is the value of the member; or 

2) as an index to a table of distinct values that can be taken by the member for each occurrence of the member. 

[0039] The first representation is the only representation used in the data structures emitted by the prior art class 

5 preloader This representation can be very inefficient when a particular member for which hundreds or thousands of 
occurrences exist only has a few distinct values. In such a situation, a full width memory word (e.g., 32 bits wide) Is 
allocated for each of the occurrences, taking up as many as thousands of words of scarce storage in the ROM 208. 
even though only a few different values are stored. The second representation, which is employed by the present 
embodiment, solves this problem by generating a value table 616 to hold the definite values of such a member and 

10 generating for each occurrence of the member an index of only as many bits as is necessary to address all of the value 
table entries. The second representation is advantageous when the memory that would be allocated for the indices 
and value table for a particular member type is smaller than the allocated memory required for the generic represen- 
tation. The method by which the present embodiment detemnines how to encode the member data structures is de- 
scribed below, in reference io FIG. 10. 

IS [0040] Once the determination of how to represent the members is made, the class preloader 1 32 outputs for each 
member to be represented in the index+table format updated header information 614 (including modified member 
declarations and accessor functions enabling the VM 246 to access the modified member information) and a respective 
value table 61 6. The header information 61 4 and value tables 61 6. which are generated as source code, are compiled 
by the compiler 122 along with the virtual machine sources 618 that define the virtual machine to be executed in the 

20 execution engine 1 02. The linker 1 36 links the resulting object data 620 and the object data 622 to generate the preload- 
ed executable module 306. whrch can be loaded into the execution engine 102. One by-product of the present em- 
bodiment Is that, whenever new classes or members are to be incorporated in the preloaded classes 148. a new VM 
246 must be generated. This is because the corresponding header information 614 and value tables 616 must be 
compiled with the VM sources 618. However, because the present embodiment automatically generates the header 

25 information 61 4 and member values 61 6 for any set of classes, generating the new VM requires no or mininnal changes 
to the VM code. This is because the VM 246 always makes use of the accessor functions that are part of the header 
information 614. Thus, the present embodiment is able to generate an efficient representation of data structure mem- 
bers while facilitating generation of the VM. 

[0041] The class preloader 1 32 is able to generate the efficient index/table member representation because all pos- 
30 sible values of the members are known. As a result, the number of bits needed for each index is also known. The 
number of occurrences of each members is also known. Moreover, the preferred embodiment presumes that the class 
files 128 represent the complete set of classes that are to be preloaded into the target execution engine 102. This 
presumption is especially applicable to execution engines 102 that are small handheld computers, which are unlikely 
to have the computing power and/or communications bandwidth to download classes on the fly in the conventional 
35 manner Given that the number of indices and values are known and that there is no possibility of adding additional 
members or classes, it is possible for the class preloader 1 32 to arrange the indices to have an optimally compact or 
near-optimal arrangement when allocated by the execution engine 102. The class preloader 132 achieves this level 
of compaction by selecting the order of the indices in the updated header information 614. 

[0042] Referring to FIGS. 7A and 78, there is illustrated the organization of the updated header infornnation 614 and 
40 the value tables 616 along with specific examples of each data structure. These examples represent the outputs gen- 
erated by the class preloader 1 32 corresponding to the data structure declaration 1230.N from FIG. 2B. 
[0043] The updated header file 614 shown in FIG. 7A includes a set of data structure declarations 702. each of which 
can include, in any combination, updated member declarations 704 and un-updated member declarations 706. Each 
data structure declaration 702 corresponds to one of the data structures used by the VM 246. The updated member 
45 declarations 704 are for data structure members that have been modified by the class preloader 1 32 as indexAable 
members and the un-updated members 706 are for data stmcture members that the class preloader 132 determined 
were best represented generically. Each data structure declaration 702 is associated with updated member table dec- 
larations 708, updated member accessor functions 71 0 and un-updated member accessor functions 712. Each updated 
member table declaration 708 is associated with a corresponding value table 616 and declares that table in the ap- 
50 propriate programming language. An updated member accessor function 710 defines the accessor function for updated 
(i.e., index/table) members using the table name defined in the respective updated member table declaration 708. The 
un-updated member accessor functions 71 2 are unchanged from those generated by the conventional class preloader 
132. 

[0044] For example, FIG. 7A shows the updated header file information 61 4 for the data structure 1230.N (Struct T) 
55 from FIG. 28. This example assumes that the class preloader 1 32 determined that: 



1) memberl has 400 values and is best represented as an index/table member. 

2) membef2 is best represented conventionally, 
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3) members has 200 values and is best represented as an index/table member, 

4) memberA has 1500 values and Is best represented as an index/table member, and 

5) members is best represented conventionally. 

(00451 Consequently, the class preloader 1 32 has generated a nrodified "struct T" declaration 704 wherein memberl 
IS represented as a 9-bit integer index m1_idx (9-bits being enough to access 200 values), members is represented 
as an 8-bit integer index m3_kix (enough to access 400 values) and member4 is represented as an 1 1 -bit integer index 
m4_idx (enough to access 1 500 values). The other members. member2and members, are left unmodified as generic 
members of type mtype2 and mtypeS. respectively. 

[0046J The class preloader 1 32 has also generated an updated member table declaration 708 for memberl showing 
that the memberl values are stored in a value table (memberl _value[]) of type memberl. The member1_value table 
is declared as an external variable (exterr)). which tells the compiler 122 that the actual values of the table are defined 
in another file, in this case the value tables file 61 6. Similar updated member table declaratbns 708 are generated for 
members and member4. a • uiui 

« [00471 The accessor function 710 for the updated memberl is correspondingly modified so that each time the cor- 
responding accessor function, memberl of(T). is invoked the VM 246 that accesses the preloaded methods uses the 
memben value (i.e.. the 9-bit m1_idx) as an index into the member1_valuetab\e. The accessor functions 710 for the 
updated members and member 4 are modified in similar fashion. 

[0048] Referring to FIG. 7B. there is shown a representation of the value tables 616. including a table 722 1 that 
defines the definite values that can be taken by the member1_va/ue table declared in the header file 61 4 In this case 
the memberl .value table is defined as a constant array ('const mtypal member1_value[]") consisting of 400 values' 
val 1 ,. . . val 400. Similar representations of the value tables for members and member4 are also provided (e a in the 
member 3 and 4 tables 722.3, 722.4). 

[0049] Referring to FIG. 8A, there is shown an illustration of the manner in which the internal data structures(specif- 
ically. the member occurrences 802 and value table 806 for a single member type) of the present embodiment are 
organized in the execution engine ROM 208. Each of the occurrences represents one occurrence in a preloaded class 
of the same member 802 and the data structure type 805 that encompasses it (the data structure type 805 is likely to 
include multiple members - e.g.. see FIG, 2B). Assuming that a particular member has N distinct values 808 which 
are stored in the value table 806. each of the M occurrences 802 of that member is allocated as an index 804 of width 
30 Ologe(N)\ + 1) bits to the entry of the value table 806 that holds the member's value. For example, each of the occurrences 
802.1 and 802.6 is an index to the table entry 806. N. This entry 806.N stores the definite value 808.N associated with 
those member occurrences. Thus, the total memory usage of this model is M'(ltoflfe(/V;i+ 1) + value_table size bits per 
member. ~ ^ 

[MiSO] Referring to FIG. 8B. there is shown an illustration of the manner in which the prior art organizes occurrences 
852 in the execution engine ROM 208 of a particular member. Each of the occurrences 852 represents one occurrence 
in a preloaded class of a particular member. Each of the M occurrences 852 of that member is allocated as a full-width 
menrory word that stores the value 854 of the member for that occurrence (i.e.. each of these occurrences are repre- 
sented in the first fomiat referred to above.) Thus, the total memory usage of this model is M*a2 bits (assuming 32-bit 
memory words). As a result, the present embodiment saves memory allocated for a particular member in the data 
structures when M*fllog2(N)h 1) + value_table_size is less than M'memory_word_size (e.g., M'32). As in the example 
of FIG . 8A. the fields 802 are likely to be just one element in a data structure declaration. 

[0051] Referring to FIG. 9 A, there is shown an example of how the class prekjader 1 32 of the present embodiment 
efficiently stores m the execution engine ROM 208 all of the members 802 of a particular data structure 902 (e a the 
members of the structure. Struct T 1230.N. FIG. 2B). Generally, the present embodiment packs the stored values (i 
e., the indices 804) so that they occupy as much of a fixed length memory word as possible. In the illustrated situation' 
the memory words are 32 bits wide, but the present embodiment is applicable to memory words of any length In the 
example shown in FIG. 9A, the 9-bit. 8-bit and 11 -bit members ml, m3 and m4 from Struct T 902 are packed into a 
single 32-bit memory word. Values of the members m2, m5, which are represented conventionalV (e 9 as 32-bit 
values), are stored in respective 32-bit general variables following the first word In the preferred embodiment these 
conventionally-represented members must be aligned on word boundaries (e.g.. every 32 bits) There is no such re- 
quirement for the modified members. Therefore, for each data structure instance there are only 4-bits of unused space 
904 between the fourth member m4 and the first genera) variable m2. The class preloader 132 aims to pack the mem- 
bers of an internal data structure into memory words as efficiently as possible given any combination of member rep- 
resentations and member sizes. 

[0052] Referring to FIG. 9B. there is shown a diagram illustrating the format of the same data structure Struct T as 
stored by the prior art class preloader Note that, in this system, the data structure requires 5 words to store the 5 
members. Thus, the pnor art is far less efficient than embodiments of the present inventbn (which only need 3 words 
to store the same data structure information). Embodiments of the method of the present invention are now descnbed 
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in reference to FIG. 10. 

[0053] This arrangement presents no problems to the preloaded classes' use of the accessor functions as the different 
memory locations of the indices 804 are resolved by the compiler 122 and the indices themselves store the Index of 

their associated value 808. ^ , ^ 

s [0054] Referring to FIG. 1 0. there is shown a flow diagram of an embodiment of the method of the present invention 
implemented in the class preloader 1 32. The present method Is implemented In two passes, which include an account- 
ing pass (represented by the box labeled 1104) and a data structure declaration generation pass (represented by the 
rest of the steps). As the first accounting step (performed for all intemal data structures), the preloader 132 identifies 
all member types of an internal data structure (1106). For example, referring to FIG. 2B. the five members of Struct T 
10 are that data structure's member types. For each member type, the class prebader 1 32 then performs the following 
processing: 

Identify M occurrences of the member type (1108). 
Identify N values of the M occurrences (1110). 
IS Determine the memory space needed to store each value (1112). 

Determine the memory space needed to store an index that can address the N values (the index must be at least 
\iog2(N)\+1 bits) 0^4); 

Determine the size of the conventional representation of the member occurrences (1116). 

20 [0055] This processing is performed on all members of all internal data structures before proceeding with the steps 
starting with box 1118. This order of processing is preferable as the accounting statistics generated by the procedures 
in the box 1104 are used by the subsequent second pass steps. Typically, the accounting statistics are stored tempo- 
rarily for use in the second pass. 

[0056] Once all of the statistics have been generated, the class preloader 1 32 computes for each member type: 



2S 



the memory space (LHS) required by the conventional representation of the member occurrences (1120); and 
the memory space (RHS) required by the novel representation of each member occurrence as an index to a value 
table (1122), 

30 [0057] The class preloader 1 32 computes the LHS value in step 11 20 as follows: 

LHS = (size of the conventional representation) x no. of occurrences 
3s = (size of the conventional representation) x M bits. 

[0058] The class preloader 1 32 computes the RHS value in step 1 1 22 as follows: 



40 



RHS = (size of the member value) x no. of occurrences + size of the value table 
(size of the member value) x M + M x (|log2(N)|+l ) bits. 



[0059] If the RHS is smaller than the LHS (1 1 24-Y). the class preloader 1 32 represents that member type as a value 
table and Indices (1 1 26). If the RHS is not smaller than the LHS (11 24-N). the class preloader will represent that member 
lype conventionally (1128). 

[0060] The class preloader 132 repeats the steps 1120. 1122. 1124. 1126, 1128 while other members remain to be 
so processed (1124-N). 

[0061] Once each member has been processed (11 24-Y), the class preloader 1 32 perfomns a data structure decla- 
ration generation procedure 1130. In this procedure, for each data structure the class preloader 132 determines the 
optimal ordering of the data structure members (1 1 32). The ordering process and its considerations have already been 
described in reference to FIG. 9A. The class preloader 132 then generates the member header information 614 and 
55 values table 616 in accordance with the optimal ordering (1134). The generation of the header information 614 and 
member values 616 has already been described in reference to FIGS. 7A and 7B. 

[0062] Referring to Fig. 1 1 , the preloaded executable module and boot time initiator 1 320 are permanently stored in 
the read-only memory of an execution engine computer. Each time the execution engine computer is powered on or 
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rebooted, the boot time initiator 1 320 is automatically executed. Among other tasks, the boot time initiator copies all 
methods and data that must be resident in random access memory during execution to the random access memory 
locations assigned to them by the linker. 

[0063] Although the method and system described herein have been described with reference to the Java orooram- 
ming language the present invention is applicable to computer systems using other objectoriented classes that utilize 
dynamic runtime loading erf classes. 

[00641 Further embodiments of the present inventton are amenable for execution on various types of executable 

r^T.? ' ITt" ^rT7 "^''^ '"'^ ^ ^"^^'^ '^^'^'y ^P«« °' executable mediums can be 

Sfs^^or floppy (S ^ computer-readable storage medium. whk:h can be any memory device, compact 

!!S!fl ^! aforementioned system and method have been described with respect to executing a generic Java ap- 
plca ion and are applicable to any Java applteation. For example, the embodiments of the present invention couW be 
employed o preload classes used by a personal information manager coded in Java intended to run on a handheld 
l^Tr, V.^" '"^^ ^PP"'^^*'^" "^^'^ be run in a distributed environment, rt can run in stand-ak^ne mode- 

rn ^''^t '°" °' ««"'«^<='^P'Jter without importing new classes from external systems 
~ VVhile embodiments of the present invention have been described with reference to a few specific embodi- 
ments, the description is illustrative of the invention and Is not to be construed as limiting the invention \ferious mod- 
ifications may occur to those skilled in the art without departing from the scope of the invention as defined bj^^e 
appended claims. /mo 



Claims 



■ tTf"^ *°[ '^""T^ "^""""^ °' P'^^^^^ ^^'asses to be incorporated into a runtime environment, com- 

prising the steps of: 

determining types of data structures represented in one or more class files used to define a plurality of preload- 
ed classes, each of the data structure types including one or more members; y f 
determining distinct values that can be taken by each of the members 

storing each of the values for at least a subset of the members selected to reduce the size of corresponding 
internal data structures composing the preloaded classes; 

generating a set of value indices for addressing stored values stored »i the storing step- and 
generating accessor functions and member declarattons that enable the runtime environment to use the se- 
lected members represented as the stored values and the set of value indices. 

^^mSr '^""'"^ "^"^"^ °' ^'^^^ '° incorporated into a runtime environment, 

a set of class files; and 

a class pretoader configured to generate from the class files a set of pretoaded classes and a plurality of 

mZf ' declarations configured to minimize size of the preloaded classes when aSocated in 

rnernory, 

StfSfoS h'?''!' '^fimBd to generate the internal data structure declarations so that each of a set 
o f"ct"re members are represented as an index to a storage structure holding distinct values 

of the selected data structure member. 

A computer program product for use in a computer system for reducing the memory footprint of preloaded classes 
to be incorporated into a runtime environment executing on an executk^n engine, the compute^p'o^^m pf^i^! 
includmg a computer readable storage medium and a computer program mechanism embedded ther^n thefc^- 
puter program mechanism comprising: 

a class pretoader configured to generate from a set of class files the preloaded classes and a plurality of 

mrl^?K^''"'*"l' declarations configured to minimize size ofthe preloaded classes when aSocateS in 
memory of the execution engine; ^aiou 

lltfSr hTT'^V "t'"^ '° generate the internal data structure declarations so that each of a set 

o Z^^^hI: rr .^'^ represented as an index to a storage structure holding distinct values 
Of the selected data structure member 
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A runtime environment built from a collection of preloaded classes defined by one or more internal data structure 
types, each including one or more members, the runtime environment comprising: 

a storage structure holding distinct values that can be taken by at least a subset of the members; and 
an index to a storage structure entry holding the distinct value of a respective member of the subset of the 
members serving as each occurrence of the respective member with the distinct value; 
such that the runtime environment determines when necessary the distinct value of the respective, member 
by retrieving contents of the storage structure at a location defined by the index. 

The runtime environment of claim 4. wherein all of the distinct values are known and the index to the storage 
structure entry is implemented using fewest bits able to index all of the distinct values associated with the respective 
member. 

6 A method for kiading an execution engine with preloaded classes to be incorporated into a runtime environment 
IS to be executed on the execution engine, comprising: downloading into the execution engine the preloaded classes, 

including a plurality ot internal data structure declarations composing the preloaded classes configured so that 
each of a set of selected data structure members is represented as an index to a stored distinct value of the 
selected data structure member, the set being selected to minimize size of the pretoaded classes wrtien alkxated 
in memory of the execution engine. 

20 

7. The method of claim 6. wherein the preloaded classes are downtoaded over the Internet. 

8. The method of claim 6, further comprising: alkjcating the preloaded classes in the memory of the execution engine. 

2S 9. A method for generating and toading into a client preloaded classes to be incorporated into a runtime environment 
to be executed on the client, comprising: 

generating in a server the pretoaded classes, including a plurality of intemal data structure declarations com- 
posing the preloaded classes configured so that each of a set of selected data structure members is repre- 
30 . sented as an index to a storage structure holding distinct values of the selected data structure member, the 

set being selected to minimize size of the preloaded classes when allocated in memory of the client; and 
downtoading into the client the preloaded classes. 

10. A method for reducing memory footprint of preloaded classes to be incorporated into a runtime environment, com- 
35 prising the steps of: 

determining distinct values that can be taken by members of intemal data structures composing the preloaded 

CiSSSGS* 

Storing each of the values for at least a subset of the members selected to reduce the size of the intemal data 

40 structures; and 

replacing each occurrence of a subset member with an index to the value of that occurrence, enabling the 

value of that occurrence to be retrieved via the index. 

11, A system for reducing the memory footprint of preloaded classes to be incorporated into a runtime environment, 
45 comprising: 

a class preloader configured to generate a set of intemal data structure declarations configured to minimize 
size of Ihe preloaded classes when allocated in memory, the internal data structure declarations declaring 
internal data structures composing the preloaded classes; 
so the class preloader being configured to generate the intemal data structure declarations so that each of a set 

of selected data structure members is represented as an index to a storage structure holding distinct values 
of the selected data structure member. 
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Struct T { 

mtypel memberl 
mtype2 member2 
mtype3 member3 
mtype4 member4 
mtypeS members 

} 
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